Upper Bound on the Efficiency of 
de-Constrained Codes 

By TA-MU CHIEN 

(Manuscript received June 2, 1970) 

We derive the limiting efficiencies of dc-con strained codes. Given bounds 
on the running digital sum (RDS), the best possible coding efficiency tj, 
for a K-ary transmission alphabet, is 77 = log 2 X mox /log a K, where X m « 
is the largest eigenvalue of a matrix which represents the transitions of 
the allowable states of RDS. Numerical results are presented for the three 
special cases of binary, ternary and quaternary alphabets. 

I. INTRODUCTION 

In digital transmission systems, the transmission channel often does 
not pass dc. This causes the well-known problem of baseline wander. 
One way to overcome this difficulty is to restrict the dc content in the 
signal stream using suitably devised codes. 1-3 As a result many codes 
having a dc-constrained property have been studied. 4-0 The coding re- 
quirement is represented by the constraint put upon the running digital 
sum (RDS) of the coded signal stream. We expect that the efficiency 
of a dc-constrained code is related to the limits of RDS in some defi- 
nite way. This is the subject to which we address ourselves in this 
paper. More specifically, we intend to answer the question: What is 
the best possible efficiency of any dc-constrained code satisfying a 
given limit on RDS? 

Let [a t , a 2 , •••) be the sequence of the transmitted symbols, the 
RDS of the signal stream at instant k is defined to be the sum ^*_j 
a, . Taking the RDS at any instant as the state of the signal stream 
at that point, the limits on RDS define a set of allowable states, and 
each additional signal symbol may be considered as a transition from 
one state to another. This transition can be represented by a matrix- 
called naturally the transition matrix. For a K-ary signal alphabet, 
the best possible efficiency jj of dc-constrained codes is found to be 

V \og 2 K (1) 

where A, nax is the largest eigenvalue of the transition matrix. 
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The efficiency of a code is defined to be the ratio of the average bits 
per symbol of the coded signal stream to that of the random (uncoded) 
signal stream. 

McCullough* has derived the same result (1) for the special cases of 
K = 2 and 3. His approach is quite different from what will be pre- 
sented in the sequel. 

We first describe in detail the construction of a mathematical model 
for the case of a binary alphabet. Then we generalize the result of the 
binary case to include any alphabet set. Methods of effecting numeri- 
cal calculation are discussed as well as approximation formulas. The 
numerical results for three important cases are presented and known 
codes are compared with the theoretical limits. 

II. LIMITING EFFICIENCY OF THE BINARY CODES 

In this section we confine our discussion to binary signals and direct 
our attenion to the intuitive reasoning which leads to the construction 
of a simple mathematical model and its interpretation. 

Let M, a positive integer, be the desired bound on the RDS of the 
coded binary signal stream. This defines a subset £«(«>) of the set 
S(<*>) of all infinite binary sequences in the following way: An infinite 
sequence is in the subset S M (<x>) if the RDS of the sequence is nowhere 
larger than M or less than —M, i.e., E*-i a « I = M for k = 1,2, ••■ . 
A sequence in S M («>) is called an allowable sequence. Denoting by 
Nm ( °° ) and N( °o ) the number of infinite sequences in Su ( °° ) and 8( <» ) 
respectively, the average information per symbol for the sequences 
in £,„(«>) is given by 

_ log 2 N if (™) m 

V \og 2 N(«>) ' K) 

assuming the ratio exists. If we interpret the set S(*>) as source data 
and £.„(°°) as the transmitted signal, then 77 defined in equation (2) 
is the efficiency of a dc-constrained code which maps one-to-one from 
£(*>) onto S M (oo).* 

Clearly for any code which satisfies the requirement that RDS be 
bounded by M, the coded signal stream must be a member of *S W (°°). 
Therefore, the set of allowable infinite sequences defined by any code 
satisfying the desired constraint on RDS must be a subset of S M (<x>). 

* The puzzle of mapping a large set to a small set can be cleared mathematically 
by observing that the cardinality of both S(oo) and Sjj(oo) are that of a 
continuum, and physically by demanding that the transmitter has a higher baud 
than the source. 
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Thus we conclude that the formal expression in (2) indeed gives the 
best possible efficiency for a given bound M. Our next step is to find 
a way to count the number of allowable sequences in S M (<x>). 

Let us start by counting the allowable sequences of finite length L. 
Define an occupancy vector of the allowable states u L , ' denoting the 
transpose of a vector (or matrix), 

u L = [u- M •■■ u ••• u+uY, (3) 

where u k , k = —M, • • • , ilf , is the number of allowable sequences 
of length L with their RDS at end equal to k, i.e., £f_, a, = k. The 
total number of allowable sequences of length L, N U (L) is simply 



N„(L) = Z 



u k 



(4) 



As L — oo, N U (L) 
of length L, N(L) = 2 l 



N M (*>) and the total number of sequences 
> N(°o). Hence we can rewrite (2) as 



i\ = lim 

L->oo 



log 2 N U (L) 



(5) 



Now our job is to find a formula for the number of allowable sequences 
of finite length. 

Suppose we know the occupancy vector u L and we want to calculate 
the occupancy vector u /j+ i . Clearly for any allowable sequence of 
length L + 1, its first L elements must be one of the allowable sequences 
of length L. We generate, therefore, the allowable sequences of length 
L + 1 from that of length L by adding one more binary symbol (+1 
or —1). Therefore, the sequences of length L + 1 in the — ilfth state 
are generated by adding — 1 to the sequences of length L in the — M + 
1st state; the sequences of length L + 1 in the —M + 1st state are 
generated by adding +1 to the sequences of length L in the —ilfth 
state and by adding — 1 to the sequences of length L in the — M + 2nd 
state; etc. It is not difficult to see that the new state occupancy vector is 



U/. + 1 = 



M-Af+l 
U-M + U- M + 2 

"-.W+l T "-if + 3 

ttjf-2 + U M 



(G) 
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Equivalently, u L+l can be written as 






Ut +i = A 


2M+lUi, 




where 








[~° l 









1 


1 




A 2 m+i = 


1 




• 1 









1 



(7) 



(8) 



is a square matrix of size 2M + 1 with ones in the superdiagonal and 
the subdiagonal and zeros elsewhere. A 2 m*i is the transition matrix of 
the allowable states. By the same reasoning, we have, 






(9) 



and 



\ii — A 2 M+i^0 



where Uo is the occupancy vector of the sequence of zero length. It is 
denned naturally with one at the zeroth state and zeros elsewhere, 






1 


0. 
From equation (9) , we obtain, by successive substitution, 



(10) 



(11) 
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The total number of allowable sequences, from equation (4), is 

N U (L) - VAiM+4h (12) 



where 



1 = 



(13) 



The class of matrices A„ defined in equation (8) has many interest- 
ing properties. Their investigation is relegated to the Appendix. 

Using the result derived in the Appendix, and adapting the following 
ordering or eigenvalues of A 2 m*i' 



< \m-i < X, 



X_jtf < A-J1/ + 1 < 

we can rewrite equation (12), 

N U (L) = l'PD L A P'u 



where 



D A = 



X_ ; 



P = 



. 
*o(A-*)/*(X-jr) 

1 (X_,,)/0(X_ w ) 







XjlfJ 

*.(Xjr)/*(Xjr) 

«i(X Jtf )/0(X Af ) 



(14) 



(15) 



(16) 



_4>2 U Qi- U )/4>&-li) 02il/(XA/)/*(XAf)J 

from Lemma 7 of the Appendix and <£(A) and </>i(A) are defined in 
equation (54) and (70). By straightforward multiplication, we can 
write, from equation (14), 



N M (L) = £ xW + i(X«) 2>/(X<), 



(17) 



where the normalization constants <£(Aj) are omitted for simplicity. 
Denote by A max the largest absolute value of the eigenvalues of A„, and 
from Lemmas 3 and 6 of the Appendix we know that 



Amm X«f A_J 



(18) 
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Then from equation (17) and Lemma 2, 

( ?M 2Jl£ "I 

N„(L) = XiUjf+iCXjr) E^A") + C-l) L *jr«(^Jf) E *,(*-") f 

+ E xW+i(W E **(*<) 



— jtf+i 



= X&* + i(X*) E [1 + (-l) Z,+Jf+ ' +, ]0/^) 



jvr— i 2 at 

+ E xW + i(x<) E*,(Xi). (i9) 

,~-Af + l ,-0 

Since <£,(X.u) > for all j (see the proof of Lemma 4), the coefficient 
of the \£ term in equation (19), 

fcr+ifar) E [1 + (-D L+ " + '" + 1 ]0y(X-) > (20) 

independent of L. 

Substituting equation (19) in (5) and using (18), we have 

, = lim^{log 2 x£«L«+ E (r^)V|} 

- log 2 X B „ + lim y log 2 z max + E (r^) *« ( 21 ) 

where z mnx and 2,- are the coefficients of X A / and X,, iV ± M in equation 
(19). The second term in equation (21) approaches zero as a limit 
since z max > 0. Thus we have the desired result for the binary case 

■q = log 2 X max . (22) 

Actually, we have proved a result more general than (22). Observe 
that, in passing to the limit, the crucial point is that z mas in (21) be 
nonzero. From equation (20) and the fact that0,(X. v ) ^ for i ^ 2M, 
we conclude that the particular u we use, though natural, is immate- 
rial, and any vector with non-negative coordinates will serve the pur- 
pose. Observe also the actual values of the allowable RDS state no- 
where enter into our discussion, hence, it is immaterial whether the 
bound on RDS be symmetric or not. We can consolidate our discussion 
by stating the following theorem. 

Theorem 1 : For a binary alphabet, if the RDS of a coded signal 
stream is required to be within some bound M* and Af~, where M + and 
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M~ are integers, then the best possible coding efficiency is given by 

7] = logo X max 

where \ max is the largest positive eigenvalue of the transition matrix A„ 
of size n = M* — M" + 1 as defined in equation (8). 

III. GENERALIZATION TO K-QXy CODES 

We now wish to extend the result derived in the previous section to 
an arbitrary Iv-ary alphabet set, {«i , • • • , a K ). We shall restrict 
ourselves to symmetric alphabets. Namely, if K is even, a,- takes on the 
values -(K - 1), -(K - 3), • • ■ , -1, +1, ■ • • , (K -1); if K 
is odd, ai takes on the values - (K - l)/2, - (K - 2)/2, • • •, -1, 0, 
1, •••, (K — l)/2. The transition matrix of allowable states is then 
given by 



a i S a i < 

where the size of the matrices is 

n = M + - M~ + 1, 



(23) 



(24) 



and M + and M ~ are the desired upper and lower bound on RDS. If 
a, = is a member of the alphabet, we follow the usual convention 
that H° n = l n . The matrices 



H.= 







I 







(25) 



and 



F n = 




1 

I 





(26) 



1 0j 

are known as superdiagonal and subdiagonal matrices respectively. To 
see that A„ given in equation (23) is indeed the transition matrix, we 
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observe that each symbol ai will generate a sequence in any allowable 
state to a state oj unit away. Each term in (23) represents the 
transition of states due to a particular alphabet. As an example, tak- 
ing the quaternary alphabet set {-3, -1, +1, +3}, the transition 
matrix is 



A n = 






1 





1 







1 





1 





1 







1 





1 






1 




1 


1 






• 1 



1 









1 





1 o_ 



With these preliminaries out of the way, we now state a general re- 
sult on the limiting efficiency of dc-constrained codes: 

Theorem 2: If the RDS of the coded K-ary signal stream is required 
to be within some bound M r and M~, then the best possible coding 
efficiency is given by 



_ 10g2 Aim* 

71 " log 2 K 



(27) 



where A. mnx is the largest eigenvalue of the transition matrix A n de- 
fined by equations (23) and (24)- 

Before we embark on the proof of Theorem 2, we need to establish 
an important auxiliary result. 

Let N M (L) denote again the number of allowable K-ary sequences, 
then the limiting efficiency ??, corresponding to equation (5), is 



log 2 N M (L) 
* = \Z L\o g2 K 



(28) 



In the set of allowable sequences S M (L), we can define a subset 
Su\ mi (jL) by restricting the first symbol to be a,- . Similarly we define 
a subset S M \ a , , — <*i(L) by restricting the first two symbols to be 
a, and —a { . Clearly 



S(L) D S M (L) D S jV ,l..(L) D S M \*i.-.i(L) , 



(29) 



and it follows that 



> 



> 



V ^ Vai ^ V^i.-at 



(30) 
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where rj Ql and i) ai ,- ai are the limiting efficiencies given by equation 
(28) with the additional restriction on the leading elements. 

Considering now all the sequences in S M \ ai .- ai (L + 2), it is not 
difficult to see that the number of sequences in Su\ at ,- ai (L + 2) is 
equal to that in Sm(L). Hence the efficiency, 

log 2 N„(L) 



Vai.-a = li m 



£-00 l0g 2 K 



£ + 2 



Coupled with equation (30), wc have shown that 

V = Vd = Vai.-ai • (32) 

A little reflection should convince us that any finite pattern at the 
beginning of the sequences does not affect the limiting efficiency -q. 
In other words, the limiting efficiency is independent of starting point 
— a fact we observed in the previous section after the detailed study 
of the transition matrix. This fact enables us to prove Theorem 2 
without going through a tedious mathematical analysis. 

Proof of Theorem 2: The matrix A n defined in equation (23) is 
real and symmetric. It can be diagonalized by an orthogonal trans- 
formation, i.e., 

A n = PD A P', PP' = 1 (33) 

where D A is a diagonal matrix of real elements Ai , • • ■ , A.„ . 

Using any u , a constant vector with non-negative elements, we 
can generate a sequence of vectors u L , 

u L = 4 *u , for L = 1,2, ... . (34) 

Since A n is a matrix with non-negative elements, it is easy to see that 
all Ul's are vectors with non-negative elements. Write 

P = [P.p 2 • • • PJ (35) 

where p, is a column vector. From equation (34) we have, using u 
with only a 1 in jth. position, 

u L = PD L A P'u 

= Zxfpy.P. (30) 

where p,, is the ;th element in p, . 
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Let X max denote the absolute value of the largest eigenvalue of A n 
and assume that, in general,* 

X] = X 2 — " ' = Xr = X mn xi (^7) 

X r + i = X r + 2 = • ■ • — X r + Jl = — X max . 

We can rewrite (36) 

{ i=1 »-r+l i-r+»+l \Amax' ^ 

Denote by z the first two sums in equation (38), 

*= Zp,,p. + (-D l Ep* 09) 

i-l t-r + 1 

z must be non-negative for any j and L. If not, then for some large 
enough L, u L will have negative elements, which is a contradiction. 
Since z is a linear combination of p x , ■ ■ • , p r+s , a set of linearly inde- 
pendent vectors, z = only if p it = 0, i = 1, • • ■ , r + a. Furthermore, 
if p ti = for all j = 1, • • • , n, then the transformation matrix P has a 
row of zeros, which is again a contradiction. Thus we conclude that, 
for some choice of u , i.e., for some j, 

u L -XjJ*+ t, (r^P.) (40) 

I I «=r + » + l \A mnx -/ J 

with z non-negative independent of L. The total number of allowable 
sequences is 



X, 



N M (L) = Xl x l'z+ L [=*- U.l'p.j- (41) 

V i=r+»+l \ ft raas' J 

Substituting N U (L) in equation (28), and passing to limit, we get 
equation (27). The proof is now complete. 

IV. NUMERICAL RESULTS AND DISCUSSIONS 

4.1 Numerical Calculation 

Using the digital computer, the calculation of X max of any transition 
matrix A„ is not difficult except maybe for large n. In the following, 
we discuss several alternative approaches to evaluating A mnx , and 
we present results for three important cases. 



* It can be shown that r = 1 and s = or 1. But the proof that follows does 
not require this fact. 
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(i) Find \ mBX by direct diagonalization of the matrix A n . There 
are computer programs developed for this purpose. This is done for 
the binaiy case and the quaternary case, and the limiting efficiency 
■q is plotted (solid curve) as a function of allowable states n in Figs. 
1 and 2 respectively. 

(n) In the binary case, the characteristic polynomial <f>„(\) of 
A n satisfies a simple recursive relation (56) . Treating (56) as a differ- 
ence equation of <£„(A)'s, one can express <£ n (A) in an alternate form: 4 



0»(X) - 



sin [(ro + 1) cos~' (A/2)] 



sin [cos -1 (X/2)] 
The roots of <£„(A) are, as easily seen in equation (42), 



X t = 2 cos 



KIT 



k = 1, ■ •• ,n. 



(42) 



(43) 



n + 1 ' 
(Hi) The ternary case can be reduced to the binary case by replac- 
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10 15 20 25 30 35 40 45 50 

n, NUMBER OF ALLOWABLE RDS STATES* 



Fig. 1 — Limiting efficiency vs allowable states binary alphabet (+1, — 1). 
*n = M* — M~ + 1, where M* and M~ are the upper and lower bound of 
the RDS. 
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5 10 15 20 25 30 35 40 45 50 

n, NUMBER OF ALLOWABLE RDS STATES* 

Fig. 2— Limiting efficiency vs allowable states quaternary alphabet (+3, +1, 
— 1, —3). *n = M* — M~ + 1, where M + and M' are the upper and lower 
bound of RDS. 

ing A in <f> n (\) by A + 1. Therefore, one gets A max of the ternary case 
by adding 1 to the corresponding (same n) \ mnx of the binary case. 
The top curve in Fig. 3 is plotted in this way. 
[iv) From the well-known formula 10 



A max = max x'A„x 



(44) 



where 



x = 



Xj 



JBn_ 



and the norm of a vector ||.t||, is 



(45) 



(46) 
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we have 



Ama* = max \ X) £ XiX i+at + 2 Jwj-«| (47) 

llx|| = l U.gO .-1 <*,<0 ."-I 



where the «i's are members in the alphabet set. For example, 

n-l 

Xmax = max 2 J2 XiXi+i, 

flx||-l . = 1 



llxll -1 V..-1 



in the quaternary case. 



(48) 



in the binary case ; 

\ max = max { 2 x* + 2 £ x,z, +1 f (49) 

in the ternary case ; and 

Jn- t n-3 

Xmax = max 2\ X) z.z.+i + 2 z.z.+af (50) 
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5 10 15 20 25 30 35 40 45 50 

n, NUMBER OF ALLOWABLE RDS 5TATE5* 

Fig. 3 — Limiting efficiency vs allowable states ternary alphabet (+1, 0, —1). 
*n = M* — M~ + 1, where M* and M~ are the upper and lower bound of 
the RDS. 
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In this formulation, A mnx becomes the extreme value of a quadratic 
form with an equality constraint. There are a number of ways to 
effect a numerical solution. 

4.2 Approximation Formula 

To search for A„ I1IX using equation (47) is not an easy alternative, 
but it leads to an estimation of A mns . If we let Z\ = x 2 = ••• = x„ , 
then, from equation (47), we have 

oiSO ' t ai<0 '• 

Any other choice of the xjs will lead to a different estimate of A mnx 
which may be better or worse than that of equation (51). We justify 
the present choice by noting the simplicity of equation (51). Using 
equation (51), we obtain an approximation formula for the limiting 
efficiency 77, 

i I \T» Tl ~~ a i 1 V* n "fc a > 

log 2 2, — — + 2. —r- 



' log 2 K 

where ai 's are members of the K-aiy alphabet set. The approximate 
v are also plotted in Figs. 1 to 3 (dashed curve) . 

As expected, the approximation is reasonably good for large n and 
it's for large n that we may have to rely on the approximation formula. 

4.3 Discussion 

(i) It is of some interest to see how the efficiencies of various codes 
with the dc-constrained property compare with the limiting curves. 
We have located the following known codes: 

ZDN (Zero-Disparity Binary Code of Block Length N) 5 and 

LDN (Low-Disparity Binary Code of Block Length N) 8 in Fig. 1; 
and 

BP (Bipolar Code) 1 , n = 2, v = 0.63*, 

PST (Paired Selected Ternary Code) 7 , n = 4, v = 0.63, 

BNZS (Bipolar with N Zero Substitution Codes) 8 , 
n = 4, rj = 0.63, 

VL43 (Variable Length Ternary Code)", n = 5, v = .84, and 

MS43 (Fixed Length Ternary Code) 9 , n = 6, v = 0.84, in Fig. 3. 



*N = 6. The allowable states for BP are —1 and or and +1 depending 
upon whether the first pulse transmitted be —1 or -f-1. A similar situation exists 
for PST, BNZS. 
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The BP and BNZ* codes are used in T-l and T-2 systems 1 - 2 re- 
spectively and PST is used in the experimental T-4 system. 3 In 
comparison with their limiting efficiencies, one gets the impression that, 
barring the fact that these codes have other properties in addition to 
the dc-constraint, there is some room for improving the coding ef- 
ficiency. VI.43, and MS43 are examples along this direction. 

It should be pointed out" that the real engineering problem is to 
control baseline wander. A true comparison of different coding schemes 
should therefore be done on this basis. The relation between RDS 
and baseline wander is an elusive one. It depends upon the detailed 
structure of the code in question and the channel to which the signal 
is applied. In terms of RDS, it depends not only on the bounds and 
the distribution of the allowable RDS but also on its dynamics. By 
dynamics we mean the "speed" of moving from one state to another, 
the dynamic behavior is of importance because the channel has 
"failing memory", so to speak. For example, it can be shown 11 under 
certain conditions that a quick jump from one extreme state to the 
other results in a larger amount of baseline wander than would occur 
from staying in an extreme state for a long time. 

(ii) As a general observation, the limiting curves saturate rapidly. 
This implies that, to make a high efficiency code possible, a physical 
system should be designed to operate beyond the fast rising portion 
of the curves. It would be reasonable then to expect that a simple 
ternary block code could be found with 90 percent efficiency or better 
for, say, n — 15. 

(Hi) An interesting question which arises naturally in connection 
with the limiting efficiency is its readability. If one accepts infinite 
delay, the answer is affirmative. If one thinks in terms of block codes 
of finite length, then the limiting efficiency cannot be realized. 

v. CONCLUSION 

We have shown that, for a dc-constrained code, the limiting effi- 
ciency is related to the number of allowable RDS states in a very 
simple way. The result is effective in the sense that it lends itself 
easily to numerical evaluation. 

The underlying mathematical fact in our proof is the property of 
non-negative matrices and vectors. Using the theorem of Frobenius 
on non-negative matrices, 10 our result can be proved in a few steps. 



* The discussion here is heuristic in nature. A thorough treatment of the sub- 
ject is beyond the scope of this paper. 
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We retain our approach for the reasons that it only requires ele- 
mentary knowledge of matrix theory and it gives more insight to the 
problem. 

The technique developed in this paper can be used to investigate 
bounds for some other classes of codes, say timing codes. This will 
be done elsewhere. 
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APPENDIX 

The class of matrices which we want to investigate has the follow- 
ing general form* 



A„ - 



1 

1 1 
1 • 









1 

1 



(53) 



Each matrix A n is a square matrix of size n and it has ones in the 
super- and sub-diagonal and zeros elsewhere. 

Let 4>„(A) denote the characteristic polynomial of A n . By definition, 

n (X) 4 det [Xl n - A n ] 
X -1 



-1 X -1 

<|.-i -1 • 


The first few polynomials <£ n (A) are 







• -1 

-1 X 



(54) 



* A special class of Jacobi matrix. 12 
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0o(X) = 1, 

*i(X) = x, 

^ 2 (X) = x 2 - l, (55) 

fc(X) = X 3 - 2X, 
* 4 (X) = X 4 - 3X 2 + 1, 

where <£ (X) is defined to be 1. The polynomials <f>„(x) have some 
interesting properties. We state them as lemmas. 

Lemma 1 : <f>„(x) satisfies the following recursive relations: 
For n ^ 2 

0„(X) = X0 n _x(X) - 0„_ 2 (X), (56) 

and for n > m, 

4>„(X) = <f> m (\)<l> n -M ~ </. n -i(X)<^ n _ m _ 1 (X). (57) 

Proof: To prove equation (56), we expand the determinant in (54) 
with respect to the first column. To prove (57), we expand the 
determinant with respect to the first m columns and observe that the 
only two nonzero products of minors are of size m and n — m. 

Lemma 2: <£„ (X) is an even (odd) polynomial if n is even (odd), i.e., 

<f> n (\) = (-1)>„(-X). (58) 

Proof: Assume that (58) is true for <£„-i(A) and <£„- 2 (X), then by 
(56), 

0„(-X) = -\4> n -i(-\) - 0»- 2 (-X), 

= (-i)"X0 n -!(x) - (-lrwx), 
= (~inx* B -i(x) -0„_ 2 (x)], 

= (-1)0,.(X). 

Since (58) is true for <j> (\) and <£i(X) by inspection of (55), (58) is 
true for any n by induction. 

Lemma 8: If X is a root of <£„(X) and X 9^ 0, then — X is also a root 

o/0„(X). 

Proof: Follows directly from the previous lemma. 
By definition, the roots of <£„(X) are the eigenvalues of matrix A n . 
Since A n is real symmetric, it follows that all the roots of <£„(X) are 
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real. Let Xj£i denote the root of <£ n (X) with largest absolute value. In 

view of the result in Lemma 3, Xj[« can always be taken to be positive. 

Lemma 4-' For any finite n, X^"i of <£ n (X) has the folloiving ordering: 

\±ll < \£L < < K:l < 2. (59) 

Proof: We will prove this lemma again by induction. Clearly, </>„(X) — * 
oo as X — » oo for any n = 1, 2, • ■ • . It follows that 4>„(\) > for X > 
X^« the largest positive root of <£„(X). Now, assume that (59) holds for 
Xi""" and X^; 2) , then, from (56), 

0„(Xi"- 1) ) = X^; l, n _ 1 (Xi"- 1> ) - n _ 2 (Xi"- ,) ) (60) 

= -in^KZ") < 0. 

Hence <£„(X) changes sign at least once between X^"" 1 ' and oo. This 
implies that 

Xm« > xi:; 1 '. (6i) 

Since (59) holds for n = 1 and 2, it holds for any n. 

To show that X^"* < 2 for any finite n, we make the observation 
that for X > 2, the matrix Xl n — A„ is a dominant matrix, 13 and there- 
fore nonsingular. For X = 2, 

4> n (2) = w0:(2) - (» - 1)0 O (2), (62) 

= n + 1 ?* 0, 

by repeated use of the recursive relation (56). 

Lemma 5: For some number X , if <£ n (X ) = 0, then <£ n _i(X ) 5^ 0. 
Proof: Assume the contrary, i.e., 

tf, n (X ) = * B _ a (X ) = 0. (63) 

Then from the recursive formula (56), 

0„_ 2 (X O ) = 0. (64) 

Repeating the same argument, we conclude 

n (X o ) = • • • = 0i(X o ) = <Ao(X ) = (65) 

which is impossible. 

Lemma 6: 0„(X) has only simple roots. 

Proof: Let A n be a root of 4> n (\), then the matrix [X 1„ — A n ] is 
singular. On the other hand, from the previous lemma, [A l n -i — 
A n -i] is nonsingular. Hence the null space of the matrix [Xol B ~ A n ] 
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is one-dimensional. From the fact that A n is diagonalizable, we con- 
clude that A must be a simple root of <j>„ (A) . 
Lemma 7: Write 



where 



and 



A n = PD A P' 
D A = diag [X, , • • • , X„] 



PP' = 1; 
then in general, P can be expressed in terms of <f>{k)'s 

0n(X,) n (X a ) n (X„) 



(66) 
(67) 

(68) 



P = 



0(X,) 0(X,>) <t>(\„) 



n-l(X,) „-,(X,,) 4>„-,(X„) 

L «(x,) 0(x 2 ) 0(x„) J 



(69) 



where 



*(X) - 



-DM 1 



(70) 



is a normalization constant. 



Proof: Let x be an eigenvector corresponding to an eigenvalue 
Xof^ n , 



Write 



and expand (71), we have 



A n x = Xx. 
x = [a, , • • ■ , x n ]' 

X 2 = X.Ti , 

x 3 = Xx-, — X\ , 

•T-4 = AX$ — Xn 



(71) 



(72) 



x n = X.r„_i — .r n _ 2 , 
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and 

X n -i = \x n . 
Delete the last equation in (72) and then compare with equations (55) 
and (56). We can make the following identification: 



Xi = # (a)i 
x 2 = <f>i (a) 



(73) 



x n = 0„-,(X). 
To normalize x, we divide (73) by the inner product of x. Denoting 
the inner product by <f> (A) , 

the normalized eigenvector corresponding to the eigenvalue A is 
given by 

[" ♦q(a) ♦-ift)T / 75) 

Uoo ' ' " ♦w J 

It is well known that eigenvectors corresponding to different eigen- 
values are orthogonal. Therefore, for P as defined in (69), PT = 1«. 
Since each column of P is an eigenvector of A n , it follows that 

A n P = PD A 
and equation (66) is immediate. 
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